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Abstract 

■^ . Given a field F, an integer n > 1, and a matrix A S Mn{F), are there polynomials 

P^ I f^g € F[X], with / monic of degree n, such that A is similar to g{Cf), where C/ is 

fH ■ the companion matrix of /? For infinite fields the answer is easily seen to positive, so we 

concentrate on finite fields. In this case we give an affirmative answer, provided \F\ > n—2. 
Moreover, for any finite field F, with \F\ = m, we construct a matrix A S Mm+siF) that 
i_i' is not similar to any matrix of the form g{Cf). 

Of use above, but also of independent interest, is a constructive procedure to deter- 

^ I mine the similarity type of any given matrix g{Cf) purely in terms of / and g, without 

Tlj" ' resorting to polynomial roots in F or in any extension thereof. This, in turn, yields an 

^^ . algorithm that, given g and the invariant factors of any A, returns the elementary divi- 

,__! I sors of g{A). It is a rational procedure, as opposed to the classical method that uses the 

^ ■ Jordan decomposition of A to find that of g{A). 

f^ , Finally, extending prior results by the authors, we show that for an integrally closed 

cn I ring R with field of fractions F and companion matrices C, D the subalgebra R{C, D) 

. . • of Mn{R) is a free -R- module of rank n + (n — m)[n — 1), where m is the degree of 

^^ , gcd(/, 5) G -^[AT], and a presentation for R{C,D) is given in terms of C and D. A 

^ I counterexample is furnished to show that R{C, D) need not be a free iZ-module if R is 

H ' not integrally closed. The preceding information is used to study Mn{R), and others, as 

- - -' i?[X] -modules. 
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1 Introduction 



We fix throughout the paper a field F and an integer n > 1. Given a monic polynomial / 

fiX + ■■■ + /„_iX"-i + X" G F[X], its companion matrix Cj G M„(F) is defined: 
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Companion matrices play a prominent role in linear algebra. Indeed, any matrix A G Mn{F) is 
similar to a unique direct sum C^j © • • • © Cq^, where 1 / (Zil • • • kr are the invariant factors of A. 
Refining this decomposition results in A being similar to Ap-^ © • • • © Ap^, where P = {pi, . . . ,Ps} is 
the set of monic irreducible factors of the minimal polynomial of A and, for p £ P, the component Ap 



is similar to Cpai 



(-^p"-t 



for unique 1 < ai • • • < a^. The powers p' 



fli 



,p"' are known as the 
This 



p-elementary divisors of A, and as p runs through P one obtains all elementary divisors of A 
material is classical and can be found, for instance, in [J]. 

Companion matrices and their applications are featured extensively in the mathematical literature, 
not only in linear algebra, but also elsewhere. The list is too long to describe here, so we restrict 
ourselves to a sample of cases. 

In linear algebra itself, [DW] gives a polar decomposition for Cf, [KS] provides a singular value 
decomposition as well as the Qi?- factorization for Cf, while the the numerical range of Cf is studied in 
[GW] . A new list of representatives under matrix similarity was recently given in [D] using companion 
matrices. 

Since Cf is the "universal root" of /, companion matrices are naturally associated to simple 
algebraic field extensions. Not long ago, [GS] gave an application in this direction. Let a G C be 
an algebraic number whose minimal polynomial over Q is assumed to be /. Then 1, a, ... , a"~^ is a 
Q-basis of the field Q[a]. If /3,7 G Q[a] then [GS] gives a closed formula for the coordinates of f3'j in 
terms of the individual coordinates of /3 and 7 and the companion matrix Cf. The formula extends 
beyond the field context to arbitrary commutative rings with identity. 

For applications of companion matrices to the solution of differential equations see [B2], to dy- 
namical systems see [LS], to quasicrystals see [GV], to modular representation theory see [N], to the 
irreducibility of polynomials see [Gu] , to the study of zeroes of polynomials see [CN] . 

Given monic polynomials f,g of degree n over a commutative ring with identity R, let C and 
D stand for their respective companion matrices. Recently, [GS2] studied the subalgebra R{C, D) of 
Mn{R) generated by C and D. It is shown in [GS2] that R{C,D) always coincides with R[C,D], the 
i?-span of all matrices C^D^ , where < i,j < n. Moreover, necessary and sufficient conditions for 
C, D to generate Mn{R) were given, and a presentation for Mn{R) in terms of C and D was furnished 
in that case. Furthermore, if i? = Z and /, g are relatively prime then the sublattice Z(C, D) of M„(Z) 
was shown to have full rank v? and the exact value of the finite index [M„(Z) : 7j{C,D)\ was found 
to be \R{f,g)\^~^, where R{f,g) is the resultant to / and g. 

A matrix A G Mn{F) is said to be cyclic if it is similar to a companion matrix, i.e., if the minimal 
and characteristic polynomials of A coincide. The terminology arises from the fact that when one 
views the column space V = F" as an F [AT] -module via A, this module has a cyclic generator if and 



only if the matrix A is cyclic. We will say that A G Mn{F) is of polynomial type if A is equal to a 
polynomial in a cyclic matrix. Since the only matrices commuting with a cyclic matrix are are its 
own polynomials, being of polynomial type means commuting with a cyclic matrix. Alternatively, 
A G Mn{F) is of polynomial type if and only if A is similar to a matrix g{Cf) for suitable polynomials 
f,gG F[X], with / monic of degree n. 

In 1995 Neumann and Praeger [NP] studied the proportion of cyclic matrices in Mn{F) when F 
is a finite field. This initiated a long series of papers (see [W], [NP2], [Br], [FNP], for instance), that 
study the probability of a matrix A to be cyclic when it belongs to a classical linear group defined 
over a finite field. Now, it is shown in [NP] that the probability that a matrix be cyclic is high. In 
this paper we consider the following question: is every matrix A G Mn{F) of polynomial type? The 
answer is easily seen to be positive for infinite fields, so we concentrate on the case when F is finite. 
In this case Theorem 3.8 gives an affirmative answer when |F| > n — 2. This is actually a consequence 
of the more general Theorem 3.5, which gives sufficient conditions for a matrix A of arbitrary size n 
over F to be of polynomial type, depending on the nature of the elementary divisors of A and the size 
of F. For a while it was unclear whether the bound \F\ > n — 2 was the result of the inadequacy of 
our methods or a counterexample really existed. Eventually the mystery was dispelled: this bound 
is as sharp as possible. Indeed, for every finite field F, say of size \F\ = m, we construct a matrix 
A £ Mm+3{F) that is not of polynomial type. Many more examples, of both positive and negative 
nature, are given throughout the paper. 

A basic tool used to deal with matrices of polynomial type is given in section 2, which is of 
independent interest. We furnish a constructive procedure to determine the similarity type of g{Cf) 
purely in terms of / and g. No appeal is made to roots of / in F or in any extension thereof. This 
naturally extends to an algorithm that yields the elementary divisors of any g{A) from g and the 
invariant factors of A, without explicitly computing g{A). It is a rational procedure, as opposed to 
the classical method that uses the Jordan decomposition of A to find that of g{A), as found in [Mj. 
As a corollary we obtain constructive criteria for g{A) to be semisimple, cyclic or diagonalizable. 

At the end of the paper we direct attention to other results from [GS2] . If i? is a unique factorization 
domain then [GS2] shows that R{C, D) is necessarily a free i?-module of rank n + [n — m){n — 1), and 
gives a presentation for R{C, D) in terms of C, D. Here we push these results under the much weaker 
hypothesis that R be integrally closed. We also show that \i R = Z[v5] then R{C,D) is not a free 
i?-module for suitable choices of / and g. 



2 Similarity type of g{A 



Given g G F[X] and A £ Mn{F) we wish to determine the elementary divisors of g{A) without 
computing g{A), finding roots or leaving the field F in any way. 

We start by making some observations about the determination of the similarity class of an arbi- 
trary matrix A G Mn{F). The interested reader may consult [J] for further information. 

We can make the column space V = F^ into an F[X]-module via A as follows: 

yv = y{A)v, yeF[X],veV. 
For y G F[X] we define 

V{y, A) = {v e V \y ■ V = 0} = {v e V \ y{A)v = 0}. 



Suppose p £ F[X] is irreducible of degree k. Then K = F[X]/{p) is a field and an F-vector space of 
dimension k. Moreover, V{p,A) is a i^-vector space and so is V{p^'^^,A)/V{p^,A) for every i > 1. It 
follows that the dimension of V{p'^,A) over F is a multiple of k for every i > 1. Here V{p^,A) = {0} 
and V{p, A) ^ {0} if and only if p is a factor of the minimal polynomial q of A. Let q have prime 
factorization q = p^^ ■ ■ -p^, where P = {pi, . . . ,Pm} is the set of of monic irreducible factors of q and 
ei, . . . , Cm > 1- Let p £ P have multiplicity e in q, and define the sequence do,di,d2, ■ ■ ■ by 

di = dimpiVip', A))/deg{p), i > 0. 

Here do = and each di, for i > 1, is a positive integer, as explained above. Note also that F(p^) = 
y^pG+ij = • • • , so the sequence dQ,di,d2, ■ ■ ■ stabilizes at dg. 

Claim 2.1 The similarity type of A is completely determined by the m sequences di,d2, ■ ■ ■ corre- 
sponding to each of the polynomials pi, . . . , pm ■ 

Proof. Let p £ P have multiplicity e in q. For each i > 1 let Oj stand for the number of elementary 
divisors of A of the form p^ with k > i and let bi stand for the number of the form p*. Clearly 
6i = oi — a2, 62 = 02 — 03, 63 = 03 — 04 ... . On the other hand, looking at the decomposition of A into 
elementary divisors, we see that 

di ^ flx, (I2 — di = 0,2, U3 — U2 ^ 0-3, U4 — d^ = 04 . . . , 



so after all 



bi = 2di - d2 - do, 62 = 2^2 - ^3 - di, 63 = 2^3 - ^4 - ^2, 



I.e. 



2di - di+i - di^i, i > 1. (1) 



Since the multiplicities of the elementary divisors of A completely describe A up to similarity, our 
claim is established. ■ 

Note that since all terms from dg on are equal, it follows that bt = for i > e + 1, as expected. 

Next let / G -F'i-'^], where / is monic of degree n. We wish to apply the above ideas to determine 
the elementary divisors of g{Cf) solely in terms of / and g. We assume that we have somehow found 
the set R = {ri, . . . , r^} of monic irreducible factors of /. A polynomial time algorithm for factoring 
polynomials over the rational numbers can be found in [LLL], and over an algebraic number field 
in [L]. Factorization methods for polynomials over a finite field can be found in [Be] and [CZ]. Let 
r £ R have degree s. By the minimal polynomial of g modulo r we mean the monic generator p 
of the ideal of all y £ F[X] such that y{g{X)) is a multiple of r. To calculate p simply reduce 
the powers \,g{X),g{X)'^ , . . . modulo r. Then the first power g{XY , with t < s, that is a linear 
combination, modulo r, of those preceding it will yield the actual coefficients to construct p. Notice 
that deg(p) < deg(r) and that p is irreducible since so is r. It is perfectly possible for g{X) to have 
the same minimal polynomial modulo distinct factors r^ and rj of /. 

We claim that the monic irreducible factors of the minimal polynomial q of g{Cf) are precisely 
the minimal polynomials of g modulo the irreducible factors of /. Indeed, for y £ F[X] we have 
y{g{Cf)) = if and only if y{g{X)) = mod /, which is equivalent to y{g{X)) = mod r"% 
1 < i < k, where Ui is the multiplicity of r^ in /. This readily yields our claim. 



Having determined the monic irreducible factors p of q what we need now is a way to compute the 
dimension of V{p'^,g{Cf)) for i > 1. We will actually compute dim V{y,g{Cf)) for every y £ F[X]. 
This requires some notation. 

Let ei, . . . , e„ stand for the canonical basis of V. We write -Fn[X] for the subspace of F[X] with 
basis 1,X, . . . ,X'^~^. If p G i^n[^] then [p] stands for the coordinates oi p relative to this basis. Note 
that every vector v £ V can be written in the form v = [p] for a unique polynomial p G F„[X]. 

Lemma 2.2 Given any y G -^[-'^l; we set z = gcd{y{g{X)), f), d = deg(z), h = f/z, V{y) = 
V{y,g{Cf)) andC = Cf. Then 

(a) If p £ Fn[X] then [p] is in V{y) if and only if h\p. 

(h) The dimension ofV{y) is equal to d, i.e., 

deg gcd{y{g{X)), f) = dim V{y, g{Cf)). 

(c) If d > then a basis for V{y) is given by [h], [Xh], . . . , [X /i]. 
Proof. By definition [p] G V{y) if and only if 

y{g{C))\p]=0. 

We easily verify that [p] = p{C)ei, so the above translates into 

yig{C))p{C)ei = 0. (2) 

The polynomials that evaluated in C annihilate ei are those that annihilate C, namely the mul- 
tiples of /. Thus (2) is equivalent to f\y{g{X))p{X), i.e. h\p. This proves (a). It is clear then 
that [h], [Xh], . . . , [X /i] are all in V{y). Moreover, looking at the last 1 in each of these column 
vectors, we see that they are linearly independent. The conditions h\p and p G -Fn[-^] show that 
[h], [Xh], . . . , [X /i] span V{y), as required. ■ 

By Lemma 2.2, if p is one of the irreducible factors of the minimal polynomial of g{Cf) then 

deg gcd{p\g{X)),f) = dim V{p\g{Cf)), i > 1. 

By our preceding discussion the values on the right hand side completely determine the p-elementary 
divisors of g{Cf) and hence so do those on the left. From a computational point of view, note that 
these degrees will strictly increase until a first unavoidable repetition; all values onward will coincide, 
with no need to compute them. We have thus proved the following. 

Theorem 2.3 The similarity type of g{Cf) is completely determined by the set P of minimal 
polynomials p of g modulo the monic irreducible factors r of f and the degrees of 

gcd{p{g{X)),f), gcd{p\g{X)),f), gcd{pHg{X)), f), . . . 

Explicitly, the set of monic irreducible factors of the minimal polynomial ofg{Cf) is P and for p G P, 
if di = deg gcd{p^{g{X)), f)/deg p, i > 1, then the multiplicity hi of p^ as an elementary divisor of 
g{Cf) is given by (1). 



Observe next that if y G F[^] is an arbitrary polynomial and h = gcd(y, /) then 

gcd{y\f)=gcA{h\f), i>l. (3) 

This can be used to our advantage to ease the computation of gcdip^ {g{X)) , f) for i > 1. 

It is now easy to complete the determination of the similarity type of g{A) from that of A. We 
have A ~ C/^ ® • • • ® C/^ , where 1 / /i | • • • | /t ^^^ the invariant factors of A and ft is the minimal 
polynomial of A. Clearly the minimal polynomial of g{A) is the minimal polynomial qf of g{Cf^). We 
compute all monic irreducible factors p of qt and, for each of these, the corresponding p-elementary 
divisors of g{Cf-^), . . . , g{Cf^), as indicated above. This produces the p-elementary divisors of g{A), as 
required. 

Proposition 2.4 Suppose g{X) has degree d > 1 and leading coefficient a. Then 

a{Ca-^fig{x))) ^ Cf®---®Cf, d times 

Proof. Let r be a monic irreducible factor of / with multiplicity e, and let z be a monic irreducible 
factor of r{g{X)). The minimal polynomial of g{X) modulo z is clearly just r. As r varies in / and z 
varies in r{g{X))^ the resulting z exhaust all monic irreducible factors of a^'^ f{g{X)). Thus, the monic 
irreducible factors of the minimal polynomial of g{Ca-"f{g(x))) ^'^^ precisely those of /. Moreover, if 
1 < i < e we have 

deg gcd{r\g{X)),f{g{X))) = deg r\g{X)) =dx deg(r^) =dx deg(gcd(rS /)), 

while if i > e we have 

deg gcd{r\g{X)),f{g{X))) =dx deg(r^) =dx deg(gcd(r\ /)). 

The result now follows from Theorem 2.3. ■ 

3 Matrices of polynomial type 

Recall the following well-known result. 

Lemma 3.1 Suppose A G Mn{F) is cyclic and B commutes with A. Then B G -^[^]. 

li A,B G Mn{F) we write A ~ S to denote that A, B are similar. If F is an algebraically closed 
field, the fact that every A G Mn{F) is of polynomial type has been known for a long time. See, for 
instance, the argument given in [MT], page 111. The proof below, due to R. Guralnick, appears in 

[GR]. 

Theorem 3.2 Let F he field. Suppose all eigenvalues of A £ Mn{F) are in F, and F has at least 
as many elements as the number of Jordan blocks of A. Then A is of polynomial type. 

Proof. By hypothesis A ~ S = Jmi(ai) © • • • © Jmsio-s)-, where oi, . . . , Cs G F. By assumption F has 
at least s distinct elements 6i, ...,6s- Let D = Jmiibi) © • • • © Jmsibs)- Then B commutes with D, 
which is cyclic, so S G F[D] by Lemma 3.1, whence A is of polynomial type. ■ 
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Corollary 3.3 If F is algebraically closed then every A G Mn{F) is of polynomial type. 
The general case is not as easy. We proceed to tackle this problem. 

Lemma 3.4 Suppose yli,...,^m G Mn{F) satisfy Ai ~ gi{Cf^), where gi,---,gm G -^[-'^l; ^.''T'd 
fii- ■ ■ 1 fm G P[^] '^I'G pairwise relatively prime. Then A = Ai(B- ■ ■ (B Am is similar to g{Cf) for some 
g G F[X] and f = fi--- /„. 

Proof. By the Chinese Remainder Theorem there exists g G F[X] such that g = gi mod fi for all 
\ <i <m. It follows that 

g{Cf, e • • • e Cfj = 5i(C7jj e • • • e gm{CfJ ~ ^i e • • • e a„ = a. 

Since /i, . . . , /„ are pairwise relatively prime, their product / satisfies 

Therefore 

g{Cf)^g{Cf,®---®CfJ^A. ■ 

We consider now the subgroup il of GL2(-F) of all matrices of the form M = { ) , where 



.01, 

a,b £ F and a ^ 0. Let F'^ and F* stand for the underlying additive and multiplicative groups of F, 
respectively. We can view F~^ and F* as subgroups of fi, and in fact fi = F^ x F*. 

The group 0, acts on the polynomial algebra F[X] by means of automorphisms as follows. If / 7^ 
then 

(/*0(X)=a— /(aX + 6), m = deg(/), 

while 0*^ = 0. Note that Q preserves the class of monic irreducible polynomials. We will refer to f^ 
as an i7-conjugate of /. We will also speak of the F* and ^'''-conjugates of /. We write Sf and Tf 
for the stabilizers of / in F^ and F*, respectively. 

Suppose / has degree m > 1 and let a be a root of / in some extension of F. If 6 G F~^ then a — b 
is a root of / . Thus if 6 G S*/ it follows that a — 6 is also a root of /. Clearly this can happen for at 
most m such b, i.e. I^jj < deg(/). In particular, Sf is finite. Assume next that F is a finite field with 
q — pd gignients, with p a prime, and that / is irreducible. The roots of / are distinct from each other 
and are permuted by Sf, although not necessarily in a transitive way. We claim that at least one of 
the stabilizers Sf, Tf is trivial. Indeed, let K = F[a], a field extension of F of degree m. The Galois 
group G of K/F is cyclic of order m generated by the Frobenius automorphism g given by x 1— t- x''. 
The group G acts transitively on the roots of /. Suppose Sf is not trivial and let 7^ 6 G Sf. Then 
m > 2 and some power of g must send q to a — 6, i.e. a'' = a — b for some 1 < e < m — 1. Since 
Xi' - X + b £ F[X] annihilates a, it follows that /|X«" - X + b. Let a G Tf. We wish to show that 
a = 1. Since a^^a is a root of /", it is also a root of /, and hence of X'^ — X + b. Thus 

= {a^'^ay'' - a~^a + b = a^'^{a''" -a + ab) = a^'^{-b + ab). 

From b = ab and 6 7^ we infer a = 1, as claimed. It follows that the number of ri-conjugates of / is 
at least \F*\. 



Note that while X'^ —X + h\s F^'-invariant, its irreducible factors need not be: they are permuted 
by the elements of F+ (e.g. X'^ + X + a G i^4[X], where a^ = a + 1, is irreducible and F2-stable but 
not F4-stable, as is the product {X"^ + X + a){X'^ + X + a + I) = X^ + X + I). 

For instance, X^—X+h G ^^[X], 6 7^ 0, is irreducible, K^-invariant and has itself the required form. 
These are the only irreducible Fp'"-invariant polynomials of prime degree p. Next let F = F2. Clearly 
X^+X+1 is F2+-invariant with prime decomposition X^+X+1 = {X'^+X+1){X^+X^+X^+X^ + 1). 
Thus the irreducible polynomial X^ + X^ + X^ + X"^ + 1 is Fg^^-invariant and divides the polynomial 
X^ — X + 1, which has the anticipated form. 

Theorem 3.5 Let F be an arbitrary field and let A G Mn{F). Given a monic irreducible factor p 
of the minimal polynomial of A, let i{p) be the sum of the multiplicities of all elementary divisors of 
A which are a power of an ^.-conjugate of p. 

Suppose that |F+| > l{jp) if Sp = {0}, and \F*\ > i{p) if Sp / {0} but Tp = {!}, for every monic 
irreducible factor p of the minimal polynomial of A. Then A is of polynomial type. More precisely, 
let qi,...,qm be the elementary divisors of A, where each qi = pi'' is a positive power of a monic 
irreducible factor pi of the minimal polynomial of A (we do not assume at all here that pi, ■ ■ ■ ,Pm o,re 
distinct). Then A ~ g{Cf), for some g G F[X] and / = /i • • • fm, where each fi = r^' for a suitable 
Vt-conjugate ri of pi, and ri, . . . ,rm are distinct. 

Proof. The elementary divisors of A give a decomposition A ~ Cg^ © • • • © Cq^ , where each qi = p^"- , 
as indicated above. The result is obvious if m = 1 so we suppose tti > 2. 

Set ri = pi- Suppose 1 < s < m and we have found distinct polynomials ri, . . . , r^ such that each 
Tj is 0-conjugate oi pi. We wish to find r^+i, distinct from all ri, . . . ,rs and an fi-conjugate to Ps+i- 

Suppose first F is infinite. As explained above, the F^-stabilizer of Ps+i is finite, so Ps+i has 
infinitely many distinct F^-conjugates. Hence there exist infinitely many Q-conjugates oips+i which 
are different from the given polynomials ri, . . . , r^ and we just pick one of them to be rg+i- 

Suppose next F is finite. As shown above, one of the stabilizers Sp^^^ , Tp^^j must be trivial. If 
Sps+i = {0} then the number of F^-conjugates of Ps+i is \F^\. In the list 1, . . . , s there are less than 
i.{ps+i) indices i such that rj is fi-conjugate to Ps+i (since qs+i, which contributes to i{ps+i), has 
not been counted yet). By assumption |F+| > i{ps+i)-, so we can find an F+-conjugate rg+i to Ps+i 
different from all ri, . . . , r^. If S'p^+i / {0} then Tp^^-^ = {1} and we may argue exactly as above, using 
F* instead of F+. 

We thus produce distinct polynomials ri,...,rm, such that each r^ is ri-conjugate to pi. Since 
Vt preserves the class of monic irreducible polynomials, ri, . . . ,rm, are pairwise relatively prime, and 
hence so are their powers /i = r^S 1 < i < m. Clearly fi is il-conjugate to qi for every 1 < i < m 
(by an element that is either in F^ or in F*). It follows from Proposition 2.4 that (7^. ^ diiCfi) for a 
suitable linear polynomial gi. Now apply Lemma 3.4. ■ 

Lemma 3.6 If A is diagonalizable then A ~ diCf) for suitable f,g£ F[X]. 

Proof. We have A ~ ailmi © • • • © aklm^, where ai, . . . , a^ are distinct elements of F. Then ailm^ = 
gi{Cf^) for fi = {X — Oj)™' and gi = ai, so Lemma 3.4 applies. ■ 

Lemma 3.7 // the elementary divisors of A £ Mn{F) are {X — ai)^, {X — 02), . . . , {X — a„_i) for 
some elements oi, . . . ,a„_i G F, not necessarily distinct, then A is of polynomial type. 



Proof. We have A ~ C(x-ai)2 ©fli-^mffi^i-^mi • • '®bklmk i where oi, 61, . . . , 6^ are distinct. By Theorem 
2.3 C(x-ai)2 © ai/„, ~ g{Cf), where 5 = X^+i + ai, / = X™+2. Let 0, ci, . . . , c^ G F be distinct. 
Then bilrm = QiiPf-) for fi = {X — Ci)"^% gi = h, 1 < i < k, so Lemma 3.4 apphes. ■ 

Theorem 3.8 Let F be a field that has at least n— 2 elements and let A £ Mn{F). ThenA^g{Cf) 
for suitable polynomials f,gG F[X], with f monic of degree n. 

Proof. We may assume that we are not in any of situations described in Lemmas 3.6 and 3.7. We may 
also assume that A itself is not similar to a companion matrix. 

Let P = {pi, . . . ,pi^} be the set of monic irreducible factors of the minimal polynomial of A. In 
view of our assumptions we have i{p) < n — 2 for every p G P. If i{p) < n — 2 for all p G P, or all 
p £ P are linear (in which case their ^"'"-stabilizer is trivial), then Theorem 3.5 applies. Suppose then 
that i{p) = n — 2 for some p £ P, and some q £ P is not linear. Then either n = 4, p = q has degree 
2, and ^ ~ Cp © Cp, in which case Proposition 2.4 applies, or else p is linear, q has degree 2 and the 
elementary divisors of A are all linear except for q, in which case Theorem 3.5 also applies. ■ 

4 The case when all ^-elementary divisors of A are equal 

Let A £ Mn{F) and let P = {pi, ■ ■ ■ ,Pm} be the set of monic irreducible polynomials of the minimal 
polynomial of A. Recall that A is semisimple if its minimal polynomial is square free. This means 
that for p £ P the p-elementary divisors of A are p, . . . ,p. We will say that A is homogeneous if, given 
any p £ P, there exist positive integers i = i{p) and k = k{p) such that the p-elementary divisors of 
A are p*, . . . ,p*, with k repetitions. 

R. Guralnick (private communication) proved that every semisimple matrix is of polynomial type. 
Using his idea, we have extended this result to all homogeneous matrices. 

Lemma 4.1 Let m > 1 and let f = oq + aiX -\ + 0^-1^™""^ + X"^ £ F[X], where oq 7^ 0. 

Given k > 1, let B £ Mkm{F) be a matrix made up of k"^ blocks of size mxm, where the diagonal and 
first superdiagonal blocks are equal to Cj, and all other blocks are 0. Let ei, . . . , Ckm be the canonical 
basis of the column space V = F^^ . Then the minimal polynomial of eij^_i\jn_^i relative to B is f^. 
In particular, B is cyclic with minimal polynomial f . 

Proof. By induction on k. The case A; = 1 is clear. Suppose k > \ and the result is true for k — \. We 
have V = U ®W ^ where U (resp. W) is the span of the first {k — l)xm (resp. last m) of the canonical 
vectors ei, . . . , Ckm- Note that U is i?-invariant. Let ui, . . . , Um be the last m of the stated spanning 
vectors of U and let ui, . . . , fm be the stated spanning vectors of W . In particular, vi = e(,fc_i)„j_|_i. 

Let g be the minimal polynomial of vi relative to B. Clearly the conductor of vi into U relative 
to B IS f, so g = hf for some h £ F[X]. On the other hand, we have 

Bvi =U2+ V2, B'^Vi = lis + ^3, . . . , -B™"^^! = Um + Vm, 

B"^vi = -{aoui + aiU2 H h Om-iWm) - {aovi + 01^2 H h am-lVm), 

so 

fiB)vi = (S™ + a„_iB™-i + • • • + aiB + aoI)vi = -aoui. 



It follows that 

= g{B)vi = h{B)f{B)vi = -aoh{B)ui. 

Since oq 7^ we infer h{B)ui = 0. It follows by inductive hypothesis that f^~^\h. But these are monic 
polynomials of the same degree, so / = /i, whence g = f ■ B 

Theorem 4.2 Every homogeneous -and so every semisimple- matrix is of polynomial type. 

Proof. Let A G Mn{F) and let P = {pi, • • • ,Pm\ be the set of monic irreducible factors of the minimal 
polynomial of A, so that A ~ Ap^ ® • • • ® Ap^, where for each p £ P there exists i{p), k{p) > 1 
such that Ap = C i(p) © • • • © C i(p), with k{p) summands. Let p £ P. Suppose first p{0) / 0. Let 
Bp be the matrix B of Lemma 4.1 constructed upon / = p^^^' and k = k{p). By construction Ap 
commutes with Bp, which is cyclic with minimal polynomial pHP)'^(p) by Lemma 4.1. Suppose next 
p(0) = 0, that is, p{X) = X. Now Ax ~ Cx^ © • • • © Cxt, with k repetitions, so by Proposition 
2.4 we have Ax ~ g{Cf), for g = X^ and / = X^^. In particular. Ax commutes with a cyclic 
matrix Bx with minimal polynomial X^"^. Since Sp^, . . . , Bp^ are cyclic with pairwise relatively prime 
minimal polynomials, it follows that Bp-^ © • • • © Bp^ is also cyclic. As this matrix commutes with 
Ap-^ © • • • © Ap^, the result follows from Lemma 3.1. ■ 

5 Examples 

Let F be an arbitrary finite field, say F = {ai, . . . , am}, where oi = 0. Let A G Mn{F) be the matrix 
of size n = m + 3 equal to the direct sum of the following t = m+1 Jordan blocks: 

A = J3(0) © Ji(0) © Ji(a2) © • • • © Ji(am). 

We claim that A is not of polynomial type. Suppose, by way of contradiction, that A ^ 9{Cf) for 
some f,gG F[X], with / monic of degree m + 3. For 1 < i < m let /ij = gcd(5f — ai,f). It follows 
from Theorem 2.3 that hi is linear for i > 1 and of degree 2 if i = 1. Note that hi, ... , hm are pairwise 
relatively prime, since so are g — ai, . . . ,g — am- Let h = hi and set Ui = deg gcd{g^ , f), i > 1. By (3) 
we have Ui = deg gcd{h^ , f) for alH > 1. By Theorem 2.3 

ui = 2, U2 = 3, U3 = 4. 

Since h has degree 2 we see from U2 = 3 that h cannot be irreducible. Then either h = r^ ov h = rs, 
where r, s G F[X] are distinct, linear and monic. li h = r^ then n2 = 3 forces r to have multiplicity 
3 in /, against u^ = 4. If /i = rs then the m + 1 distinct monic linear polynomials r, s, /12, . . . , hm of 
F[X] are factors of /, which is absurd as \F\ = m. This proves our claim. 

This shows that over any given finite field there is a matrix that is not of polynomial type. Since, 
\F\ < n — 2 and \F\ < t, our example shows that the bounds given in Theorems 3.2 and 3.8 are best 
possible. Moreover, our example also proves that Lemmas 3.6 and 3.7 cannot be pushed any further 
in the same direction. Furthermore, the presence of the X-elementary divisors X"^, X in A shows that 
our conditions on Theorem 4.2 are well placed. In this regard, we believe that even the following 
might be true: if p G F[X] is any monic irreducible polynomial then there exists A G Mn{F), all 
whose elementary divisors are powers of p, that is not of polynomial type (see Section 6 for the case 
p = X). 
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R. Guralnick asks whether a semisimple matrix commutes with a cychc semisimple matrix. In 
the infinite case, or under the weaker hypotheses of Theorem 3.5, we see that any A G Mn{F) will 
commute with a cyclic matrix B whose elementary divisors have the same exponents as those of A. In 
particular, B is semisimple if so is A. We do not know the answer in general. In this regard, consider 
the following statement. 

(S) Let F be a field and let P = {pi, • • • , Pm} be a set of monic irreducible polynomials in 
F[X]. Let di,...,dm be a list of positive integers (with possible repetitions). Then there exist 
polynomials gi, ■ ■ ■ ,gm G F'lX] such that each gi has degree di, each pi{gi{X)) is multiplicity free, and 
the polynomials pi{gi{X)), . . . ,Pm{gmiX)) are pairwise relatively prime. 

Note that if (S) is true when F is a finite field then every semisimple matrix commutes with a 
cyclic semisimple matrix by Proposition 2.4 and Lemma 3.4. For instance, 

• F = F2, p = X^ + X + 1 and A = Cp ® Cp. Take g = X^ + X. Then p{g{X)) = X^ + X + 1 is 
irreducible and A ^ g{Cp(g(^x))) by Proposition 2.4. 

• F = F3, p = X^ + 1 and A = Cp e Cp e Cp e Cp. Take g = X*. Then p{g{X)) = X^ + 1 is 
multiplicity free (look at its derivative) and A ^ g{Cp(^g(^x))) by Proposition 2.4. 

• F = F4, p = X^ + X + a, where a^ + a + I = {), q = X"^ + X + {a + I), A = Cp ® Cp ® Cq and 
B = Cp®Cp®Cq® Cq. Take g = X"^ + X. Then p{g{X)) = X'^ + X + a, which is multiplicity free 
and not a multiple of q, and q{g{X)) = X'^ + X + (a + 1) is multiplicity free and relatively prime to 
p{g{X)). Hence A (resp. B) commutes with a cyclic and semisimple matrix with minimal polynomial 
p{g{X))q{X) (resp. p{g{X))q{g{X))). 

Example 5.1 Let F = Fp, where p is a prime, and consider an F+-invariant irreducible polynomial 
of degree p, which must necessarily be of the form q{X) = X^ — X — a, where ^ a £ F. We claim 
that A = Cq2 © Cq is of polynomial type. 

We consider first the case p odd. Make the initial assumption that a = 1. Let g{X) = X'^ and 
consider the polynomial h{X) = q{g{X)) = X'^'^ — X'^ — 1. We claim that h is the product of 2 
irreducible relatively prime polynomials r and s of degree p. Since h'{X) = —2 7^ 0, it is clear that 
h has no repeated irreducible factors. Now q{X) = {X — a){X — a^) ■ ■ ■ {X — a^ ) for a root a in 
some extension of Fp. Thus 



Clearly 



h{X) = q{X^) = {X^ - a){X^ - a^) ■ ■ ■ (X^ - a*""'). 



1 = aa^ ■ ■ ■ a^ 



Since l+p + p^ + ---+j)P ^is odd, a has a square root, say /3, in the multiplicative group of the field 
Fp[a]. Thus 

hiX) = iX- /3)(X - /3P) ... (X - /^P""') x{X + f3){X + /3P) . . . (X + ^p"'") = r{X) x s{X) 

The polynomials r, s are clearly invariant under the Frobenius automorphism x 1— )■ x^, so they have 
coefficients in Fp. They are irreducible, since the degree of /3 and — /3 is p (the degree cannot be 1 
since /3^ = a, and there is simply no other option for this degree). This establishes the claim. 

We now let f{X) = r^{X)s{X). Then f{X) divides q'^{g{X)) = h{Xf = r{Xfs{Xf but not 
q{g{X)) = r{X)s{X). Since q{X) is irreducible, the minimal polynomial of g{Cf) is q'^ and the 
remaining invariant factor has no other option than being q. Thus A '^ g{Cf). Now in the general 
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case q = X^ — X — a, we claim that the same choice of / works with g{X) = aX'^. Indeed, q{g{X)) = 
aX'^P - aX'^ -a = a{X^P -X -I). Thus, by the previous case, / divides q^{g{X)) but not q{g{X)), 
as required. 

As an illustration, when p = 3 and q{X) = X^ — X - 1 we have h{X) = X^ — X"^ — 1 = 
{X^ -X^ - X - 1){X^ + X2 - X + 1) = r{X)s{X). For q{X) = X^ - X + 1 we take g{X) = -X"^. 

In the case p = 2 we have q{X) = X'^ + X + I and we just take f{X) = q{X)^ and g{X) = X'^ . 
Then / divides q^{g{X)) = q*{X) but not q{g{X)) = q^{X), as required. 

6 The case of a nilpotent matrix 

Here we focus on a matrix A G Mn{F) that is equal to the direct sum of Jordan blocks of various sizes 
corresponding to the same eigenvalue. We wish to see if A is of polynomial type. In view of our next 
result we may assume that this eigenvalue is 0, i.e. A is nilpotent. 

Lemma 6.1 Suppose that A is similar to g{Cf). Then A + al is similar to {g + a){Cf). 
Proof. We have YAY-^ = g{Cf), for Y G GL„(F), so Y{A + aI)Y-^ = {g + a){Cf). ■ 

Assume henceforth that A is nilpotent and write A = l^'^'j • • • \n^^"'\ c{i) > 0, to mean that A is 
the direct sum of c(i) 0-Jordan blocks Jj of size i, where 1 • c(l) + 2 • c(2) + • • • + n • c{n) = n. 

There may be repetitions amongst the non-zero c(l), . . . , c{n) and this will play an important role. 
To account for this, we set r(i) = \{j \ c{j) = c{i)}\ for all 1 < i < n such that c{i) / 0. 

Suppose that F[X] has at least r{i) monic irreducible polynomials of degree c{i), for all 1 < i < n 
such that c(i) / 0. We may then select monic polynomials 9i, . . . , g™ € F[X] such that deg(gi) = c(i), 
with qi irreducible if c{i) ^ 0, and all polynomials qi,. . . ,qn different from 1 are distinct from each 
other. Using Theorem 2.3 it is easy to see that 

A ~ g{Cf), for g = qiq2 ■ ■ ■ qn and f = q\ql • • • g". (4) 

Observe how (4) agrees with Proposition 2.4 when all but one of the indices j satisfy c{j) = 0, i.e. 
when A = i'^ (from now on we agree to omit j if and only if c{j) = 0, as well as c{j) if and only if 

c(j) = 1). 

Let us concentrate on the case when i^ is a finite field (otherwise Theorem 3.8 settles the matter). 
For each d > 1, let N{d) be the number of monic irreducible polynomials of degree d in -F[X]. There 
is a formula (see [J2]) due to Gauss for N{d). Using Gauss' formula we may verify whether or not 

N{c{i)) > r{i), when c{i) / 0. (5) 

However, as seen below, condition (5) is not necessary for A to be of polynomial type, as one may 
establish this fact using other polynomials f,g. It is however necessary in some cases, and this allows 
us to provide examples of nilpotent matrices that are not of polynomial type. 

Suppose first A = l\2\i, i.e. A = Ji J2 Jj. By Theorem 2.3 A ~ giCf) with g = r'^s, f = r^s^ 
and r ^ s G F[X] linear. Here r(l) = 3 and -F2[^] has only 2 linear polynomials, but we can still 
show that A is of polynomial type using a different method than (4). 

Suppose next A = 1|3|4, i.e. A = Ji J3 J4 G Ms{F). By Theorem 2.3 A ~ g{Cf) with g = r^s, 
f = r'^s and r ^ s £ F[X] linear. The same comments made above apply. 

12 



This cannot be continued. Indeed, suppose next F = F2 and A = 1|3|5 G Mg{F). In this case 
iV(l) = 2 < 3 = r(l). We claim that A is not of polynomial type. To see the claim suppose, by way 
of contradiction, that A ~ diCf) for some f,g& -^[^]) with / monic of degree 9. Let h = gcd{g,f) 
and set Ui = deg gcd{g^, f), i > I. By (3) we have Ui = deg gcd(/i*, /) for all i > 1. By Theorem 2.3 

ui = 3, U2 = 5, U3 = 7, ti4 = 8, ti5 = 9 

Thus /i has degree 3. From ti2 = 5 we see that h cannot be irreducible. Suppose h = rs, with 
r,s £ F[X] irreducible of degrees 2 and 1, respectively. Then U2 = 5 forces s to have multiplicity 1 in 
/, but then ^4 = 8 becomes impossible. Suppose next h = r'^s, with r, s G F[^] distinct and linear. 
Then U2 = 5 forces s to have multiplicity 1 or r multiplicity 3 in /. In the first case U4 = 8 forces r 
to have multiplicity 7 in /, which contradicts u^ = 9. In the second case U3 = 7 becomes impossible. 
This proves the claim. 

Using the preceding ideas we easily see that every nilpotent matrix of size n < 8 over any field is 
of polynomial type. The example of A = 1|3|5 over F2 shows that this cannot be extended to n = 9. 

Using exactly the same method we also verified that if F = i^3 and A = 1|4|7|10 then A is not of 
polynomial type. This seems to indicate that if F has size m and A = l|?7i + 1| • • • |m^ + 1 then A is 
not of polynomial type. 

As seen above, while (4) gives a pleasant formula, it is not sufficiently general to deal with all 
cases in which A is of polynomial type. This can be achieved, at least in principle, using our work 
from section 2, as hinted in our treatment of the case A = 1|3|5. Indeed, Claim 2.1 shows that the 
similarity classes of nilpotent matrices A in Mn{F) of fixed nilpotency class t, with 1 < t < n, are 
in bijective correspondence with the collection of all strictly increasing sequences of natural numbers 
si < S2 • • • < Sj = n. If N{A) is the nullity of A, the correspondence is 

[A] ^ si = N{A) <S2 = N{A^) <... <st = iV(A*). 

By Theorem 2.3 such A will be of polynomial type if and only if there exist f,g& -^[^]) with / monic 
of degree n, such that 

deg gcd{g\f) = Si, l<i<t. 

By (3) we may assume that g\f. Let / = r"^ • • • r^*" and g = r^^ ■ ■ ■ r^*" be the prime decompositions 
of / and g, where 1 < ai, 6j < Oj and di = deg(rj) for all 1 < i < A;. It is necessary to require that 
1 < 6j for all 1 < i < A; since g{CfY = implies f\g^. 

Thus, A will be of polynomial type if and only if there exist k > 1 distinct monic irreducible 
polynomials ri, . . . , r^ in F[X] of degrees di, . . . ,dk and integers 1 < 6j < Oj, for all 1 < i < /c, such 
that 

min{i6i, ai}di + • • • + min{i6fc, ak}dk = Si, 1 < i <t. 

The first of these equations says that deg(5f) = si and and the last that f\g^. 

7 Further consequences 

We present applications of our work in section 2 which were postponed to deal with matrices of 
polynomial type. 

Our first application, to linear algebra itself, follows easily from our work in section 2. 
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Theorem 7.1 Let g G F[X] and let A G Mn{F) have minimal polynomial f . Let P be the set of 
minimal polynomials of g modulo the monic irreducible factors of f . Then g{A) is semisimple if and 
only if gcd{p{g{X)), f) = gcd{p'^{g{X)), f) for all p G P. Moreover, g{A) is cyclic if and only if A 
is cyclic and deg gcd{p{g{X)), f) = deg p for all p £ P. Furthermore, all eigenvalues of g{A) are 
in F if and only if P = {X — ai, . . . ,X — am} for some oi, . . . , Om £ F, in which case these are the 
eigenvalues of g{A), with g{A) diagonalizable if and only if gcd{g — ai, f) = gcd{{g — ai)'^,f) for all 
1 < i < m. 

We next give an application to simple algebraic field extensions. Let F be a field, let / G F[X] be 
monic of degree n and let a be algebraic over F with minimal polynomial /. Then K = F[a] is a field 
with F-basis B = {l,a, . . . ,a"~^}. Given /3 £ K one often needs to find the minimal polynomial, 
trace, norm, or inverse of /3 (in this last case if /? 7^ 0). There are several ways of doing this. One way 
is to look at the regular representation i : K ^ Endir(i^), where ifs is the map "multiplication by /3" 
and find the previous information about ijs, using the matrix MsiH-p) of Up relative to B. We wish 
to find a closed formula for Mb{I^p) that can be of use for all these purposes. We have /3 = g{(y) for 
a unique g G F„[X], and it is clear that Msiia) = Cj. Therefore, invoking Lemma 3.1 of [GS2], we 
obtain 

Msiif,) = Msiig^a)) = Msigiia)) = giMBiia)) = 9{Cf) = {[g] Cf[g] . . . q-'[g]). (6) 

Thus Msi^p) can be computed by applying the successive powers of C/ to [g\. It is also clear that (6) 
remains valid if we replace F by an arbitrary commutative ring with identity i?, as long as / G R[X] 
satisfies /(a) = and g{a) 7^ for all g G i2„[X]. 

The matrix g{Cf) = {[g] Cf[g] . . . C?" [g]) can be used to answer any questions about /3 = g{a). 
In the field case, where the minimal polynomial p of /? is irreducible, our work from Section 2 then says 
that p is the minimal polynomial of g modulo / (this is how one finds p in practice, perhaps without 
noticing). The trace and norm of /? are those of g{Cf). If /5 7^ then {3^^ = [h], where h G -Fn[^] and 
[h] is the first column of g{Cf)~^. Of course, g{Cf)~^ = h{Cf), and h can also be computed using 
Euclid's algorithm. If we actually compute p first, as the minimal polynomial of g modulo /, we can 
find 13^^ in a third way, namely by using the relation p{f3) = and isolating the independent term. 

It turns out that Lemma 2.2 and some of its consequences hold in the context of rings. For the 
remainder of this section we assume that R is an integral domain with field of fractions F. Recall that 
R is said to be integrally closed if any root in F of a monic polynomial in R[X] actually lies in R. 

Lemma 7.2 Suppose that R is integrally closed. Let f,g £ F[X], with f monic, and let d = 
gcd{f,g) G F[X]. Then d G R[X]. 

Proof. Let K be an algebraic closure of F. Then d = {X — ai) ■ ■ ■ {X — am) for some Oj G K. The Oj 
are roots of / and hence they are integral over R. It follows that the coefficients of d are integral over 
R. These coefficients are in F and R is integrally closed, so d G R[X]. ■ 

We now extend all of the notation used in Lemma 2.2 to R. Assume that z = gcd(y{g(X), /), as 
calculated in -F[X], actually belongs to R[X] (this is guaranteed if R is integrally closed by Lemma 
7.2). Then Lemma 2.2 is true in this generality. Thus, [p] G V{y) if and only if h\p, and V{y) is a free 
i?-module of rank d = deg z with i?-basis [h], [Xh], . . . , [X /i]. The same proof works. 

The following is a consequence of the preceding remarks and Lemma 2.2 applied to the case g = X 
and y{X) = {X - a)'. 
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Corollary 7.3 Let a be a root of f in R of multiplicity at least i, denote the corresponding gener- 
alized eigenspace of Cf by E = {v £ V \ {Cf — aiyv = 0}, and set h = f /{X — a)*. Then E is a free 
R-module of rank i spanned by [h], [hX], . . . , [X^~^h]. 

The following is a consequence of Corollary 7.3 applied to the case i = 1. 

Corollary 7.4 Let a be a root of f in R, let E be corresponding eigenspace of Cf and set h = 
f /{X — a). Then E is a free R-module of rank 1 spanned by [h]. 

Note that the field cases of Corollaries 7.3 and 7.4 appear in different but equivalent forms in [B], 
[Gi] and [R]. 

The following is a consequence of Lemma 2.2 applied to the case y{X) = X and arbitrary g G i?[X]. 

Corollary 7.5 Let N be the nullspace of g{Cf). Suppose that z = gcd{g,f) G F[X] actually 
belongs to R[X] (this is guaranteed if R is integrally closed) and let d be its degree. Set h = f jz. Let 
p G Rn[X]. Then [p] £ N if and only if h\p. Moreover, N is a free R-module of rank d = deg z and, 
ifd>0, then [h], [Xh], ..., [X'^-^h] is an R-basis of N . 

8 Companion matrices and subalgebras of Mn{R) 

In this section R is an integral domain with field of fractions F. Let f,g£ Li[X] be monic polynomials 
of degree n, with respective companion matrices C,D £ Mn{R)- We further let d = gcd{f,g) £ E[X], 
whose degree will be denoted by m. Note that d G R[X] provided R is integrally closed, by Lemma 
7.2. We also let h = f/d£ F[X]. 

We let R{C,D) stand for the subalgebra of Mn{R) generated by C and D, and R[C,D] for 
the i?-span of all matrices C^D^, where < i,j < n. It is shown in Corollary 6.3 of [GS2] that 
R{C, D) = R[C, D]. Moreover, Theorem 9.2 of [GS2] shows that if i? is a unique factorization domain 
then i?((7, Z?) is necessarily a free i?-module of rank n + (n — m)(n — l), and Theorem 9.3 of [GS2] gives 
a presentation for R{C, D) in terms of C, D. As indicated in the Introduction, we wish to extend these 
results to the more general setting of integrally closed domains as well as furnishing a counterexample 
to the freeness of R{C,D) when R = Z[v5]. 

Theorem 8.1 The n+ {n — m){n — 1) elements 

L, C, ..., C'-^D, CD, ..., C"-™-^!?, ..., D''-\CD''~\ ..., c^-^-^i^"-^ 

of R{C, D) are linearly independent over F , and hence over R. 

Proof. Suppose that po £ -^nl-'^] and pi, ...,pn~i G Fn-m[X] satisfy 

Po{C)L + pi{C)D + ---+pn-i{C)D^-^ = 0. (7) 

We wish to show that po,Pi, •••,Pn-i = 0. Since deg po < deg / and deg pi < deg /i for i > 1, it 
suffices to show that f\po and h\pi, ..., h\pn-i. Since 

Pi{C)D + ■■■ +p„_i(C)D"-i ^ -po{C), 



15 



it follows that pi{C)D + • • • + pn-i{C)D"' ^ commutes with C. Thus 

Pi{C){CD - DC)+pi{C){CD^ - DC^) + ■■■ + pn-i{C){C D''-^ - D''-^C) = 0. 

Now 

{CD - DC)ei = ■■■ = {CD'^-'^ - D'^-'^C)ei = 0, 

so for s = / — (7 G Rn [X\ we have 

= pn-i{C){CD^~^ - D^'^C)ei = Pn^i{C)[s\. 

By Lemma 3.2 of [GS2], / divides Pn-is and hence Pn-ig- It follows that h divides Pn-i, so 
Pn-i = 0. Proceeding like this with 62, ..., e^-i we see that Pn-2 = Pn-3 = ••• = Pi = 0. Going back 
to (7) shows that f\po, so po = 0. ■ 

Corollary 8.2 The n + (n — m){n — 1) matrices 

I, C, ..., C7"-i, D, CD, ..., C"~™-1d, ..., D''-\CD''-\ ..., C7«-™-ii:)"-i 

span R{C,D) if and only if d £ R[X]. In particular, R{C,D) is a free R-module of rank n + (n — 
m){n — 1) provided d G ^[-'l^], and in particular if R is integrally closed. 

Proof. Recah first of aU that R{C, D) = R[C, D]. 

Suppose first d £ R[X]. It follows that h also belongs to i?[X]. By Lemma 9.1 of [GS2], if 
p G R[X] then dividing p by /i we may write p{C)D in the form q{C) + z{C)D, where q £ R[X] and 
z G Rn~m[X], SO the listed matrices span R{C,D). 

If d ^ R[X] then h ^ R[X]. The linear independence of the listed matrices and Lemma 9.1 of [GS2] 
show that C^~"^D is an F-linear combination of the listed matrices but not an i?-linear combination 
of them. ■ 

Corollary 8.3 Let R he an integral domain. If R{C,D) is an R-free module then its rank must be 
n+ {n — m){n — 1). 

Proof. An i?-basis of R{C, D) is an F-basis of F{C, D), which has dimension n + (n — m){n — 1) by 
Corollary 8.2. ■ 

Theorem 8.4 Suppose d G R[X]. Let the polynomials Pi, . . . ,Pn~i G i?[X, y] be defined as in 
Theorem 7. 1 of [GS2] or, more generally, be arbitrary while satisfying 

D^C = Pj{C,D), j = l,...,n-l. 

Then the algebra R{C, D) has presentation 

{X,Y\ f{X) = 0, g{Y) = 0, h{X){X -Y) = Q, Y^X = Pj{X,Y), j = l,...,n). 

Proof. Write $7 : R{X, Y) — )■ R{C, D) for the natural i?-algebra epimorphism that sends X to C 
and Y to D. Let K be the kernel of i7. Set S = R{X,Y)/K and let A and B be the images of 
X and Y in S. Then g{B) = and B^ A = Pj{A,B), j = 1, . . . ,n — 1. It follows from Lemma 6.1 
of [GS2] that S = R[A,B]. From f{A) = and g{B) = it follows that S is i?-spanned by A'B^, 
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< i, j < n — 1. Thus the relation h{A){A — B) = allows R[A, B] to be spanned by the reduced list 
of n + (n — m){n — 1) matrices: 

I, A,..., A''-\B, AB,..., A"-™-^^, . . . , B''-\AB''-\ ..., ^«-™-i5"-i. 

Let A : S" — )■ R{C, D) be the map induced by il. We wish to show that A is injective. Let 
t G ker A. Then t is a linear combination of the above listed matrices. Their images under A are 
linearly independent by Theorem 8.1, so t = 0. ■ 

Our next example shows that R{C, D) = i?[C, D] need not be a free i?-module. 

Example 8.5 Let R = Z[-v/5], with field of fractions F = Q[-v/5]. Let n = 2 and set 

g = (X-{l- \/5)/2)(X - (1 + ^/5)/2) = X^-X-le R[X], 

f = {X -{1- V5)/2){X - (5 + \/5)/2) = X^ -3X -Vbe R[X]. 
Then the subalgebra R{C,D) of M2{R) is not a free R-module. 
Proof. This will proceed in a number of steps. 

(51) Let L be the ideal of R generated by 2 and 1 — a/S and let J be the subset of R formed by 
all a + 6v5 such that a, 6 € Z have the same parity, i.e. a = b mod 2. We claim that L = J and that 
R/L = Z2 is a field with 2 elements. 

First of all J is clearly a subgroup of R that is stable under multiplication by \/^. It follows that 
J is an ideal of R. Since 2, 1 — \/5 G J, we conclude that L C J. On the other hand, all 2a + 26\/5 
are clearly in L as well as all 2a + 1 + (26 — l)\/5 = 2(a + 6\/5) + 1 — \/5. Thus J C L, so L = J. 
Finally, v5 = 1 mod L, so a + 6v5 = a + 6 = 0, 1 mod L, as required. 

(52) R is not integrally closed, as (1 — v5)/2 G -F is a root of the monic polynomial g G i?[X]. 

(53) The companion matrices of / and g are 

C={' ^V D ^' ' 



.13/' VI 1 

Observe that 



CD 



\/5 \/5' 
3 4 



Note that d = gcd(/, g) G F'iX] is X — (1 — v5)/2 and d does not belong to i?[X]. Its degree is m = 1. 
By Theorem 8.1, or by inspection, the matrices I,C,D are linearly independent. By Corollary 8.3, if 
R{C, D) is a free i?-module its rank must be 3. 

(S4) Set M = R[C, D] = R{C, D) and K = R/L, both of which are i?-modules. Then 

H = RomR{M,K) 

is an i?-module via 

{rf){m) = r{f{m)). 
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Since L annihilates K it also annihilates H, which makes H into an i?/L-niodules, i.e. a X-vector 
space. 

(55) Here we prove that the dimension of H over K is 4. More explicitly, we show that any R- 
homomorphism M ^ K can be arbitrarily defined on /, C, D, CD (taking a value of or 1 in K) and 
then extended i?-linearly to M. This is true in spite of the fact that CD is an F- linear combination 
of /,C,L». 

Indeed, we know that M is i?-spanned by /, C, D, CD. Let a;i, X2, X3, X4 G i^ be any values. Then 
the necessary and sufficient condition for the existence of an i?-homomorphism f : M ^ K satisfying 

f{ail + a2C + a^D + a^CD) = aixi + 02X2 + 03X3 + 04X4, 

for all ai G i?, is that 

ail + a2C + a-i,D + a^CD = ^=^ aixi + 02X2 + 03X3 + 04X4 = 0. 

The curious fact here is that if ail + a2C + a^D + a^CD = then all Oj G L, which clearly implies 
aixi + 02X2 + 03X3 + 04X4 = since L annihilates K. 

Thus, we are reduced to show that ai £ R and ail + 026* + a^D + a^CD = implies Oj G L, i.e. 
the linear relations between /, C, -D, CD all have coefficients in L. 

To prove this we first work over F. We look for all 4-tuples (ai, 02, 03, 04) G -F^ such that ai/ + 
02(7 + 03!) + a^CD = 0. Since /, C, D are linearly independent and CD is an F-linear combination 
of /, C, I? (by Lemma 9.1 of [GS]) it follows that the space of all these 4-tuples is a subspace of F^ of 
dimension 1. Thus all solutions are scalar multiples of a fixed solution. One such solution is 

\/5/ + (1 - \/5)/2C + (5 + V5)/2D -CD = 0. 

Let c (z F. Scaling the above solution by c yields a solution in i?^ if and only if c = a + 6\/5 G R (look 
at the coefficient of CD) and c(5 + \/5)/2 G R (look at the coefficients of C and D). Now 

c(5 + \/5)/2 = (a + 6\/5)(5 + \/5)/2 = (5(a + 6) + (a + 56)\/5)/2, 

so the last condition can be replaced by a = 6 mod 2, i.e. c G L. It now only remains to show that 
if c G L then c\/5, c(3 - (5 + \/5)/2), (5 + \/5)/2, -c G L. This reduces to checking c(5 + \/5)/2 G L, 
where c = a + 6\/5 G L. If a = 2fc, 6 = 21 are both even then c(5 + \/5)/2 = (/c + /\/5)(5 + \/5) G L 
since 5 + v5 G L. If a = 2/c + 1, 6 = 2/ + 1 are both odd then 

c(5 + \/5)/2 = (A: + /V5)(5 + ^/b) + {I + ^/b){b + \/5)/2 = (/c + /^)(5 + \/5) + 5 + 3\/5 G L. 

(56) M = R{C, D) is not a free i?-module. 

Suppose it is free. By (S3) M has rank 3. Then H, as defined in (S4), would easily be seen to 
have dimension 3. But it has dimension 4 by (S5). This completes the proof. ■ 

9 Viewing Mn{R) as an i?[X]-niodule 

We maintain the notation introduced at the beginning of section 8. Having dealt so extensively with 
elementary divisors, it would be unfair not to translate the content of Theorem 8.1 into the language 
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of invariant factors. Note first of all that the uniqueness part of the invariant factor theorem holds in 
much greater generality than its existence counterpart. In particular it is valid for any commutative 
ring with identity (as shown in [K]). We can make Mn{R) into an R[X] module via C as follows: 

p{X)-A = p{C)A, p{X)(^R[X],A£Mn{R). 

Clearly R[C] C R{C,D) = R[C,D] are i?[X] -sub modules of Mn{R), so we can form the quotient 
module M = R{C, D)/R[C]. We further let Di = D + R[C] G M, . . . , Dn-i = L*""^ + R[C] G M. 

Theorem 9.1 Suppose that h G R[X] (again, this is automatic if R is integrally closed). Then the 
R[X]-module M has the following cyclic decomposition: 

M = R[X]-Di®---R[X]-Dn-i, 

where the annihilating ideals are R[X]h, . . . , R[X]h, with n — 1 repetitions, i.e. the invariant factors 
of M are h, . . . ,h, with n — 1 repetitions. 

Proof. This follows immediately from Theorem 8.1, its proof, and Lemma 9.1 of [GS2]. ■ 

Recah from Corollary 5.2 of [GS2] that M„(i?) = R{C, D) if and only if the resultant R{f, g) of / 
and g is unit in R. 

Theorem 9.2 If R(f,g) is a unit in R then the R[X]-module Mn{R) has the cyclic decomposition: 

Mn{R) = R[X] ■ C e R[X] •£)•••© R[X] ■ Z)"-\ 

where the annihilating ideals are R[X]f^ R[X]f, . . . , R[X]f , with n repetitions, i.e. the invariant factors 
of Mn{R) are f,f,...,f, with n repetitions. 

Proof. This follows immediately from Theorem 8.1. ■ 

Except for the use of D, Theorem 9.2 is really about the action of C on Mn{R) and how this 
yields the invariant factor / repeated n times. There is no need to use companion matrices for this: 
it remains valid in greater generality. For instance, in the field case, the invariant factors of M„(F) 
as an ^[Xj-module via a given A G Mn{F) are just the invariant factors of A repeated n times. 

Theorem 9.3 Let A G Mn{R) have monic invariant factors 1 7^ fi\---\fm- This means that 
A ~ Cf^®- ■ ■®Cf^ or, equivalently, the column space R^ , viewed as an R[X]-module via A, decomposes 
as the direct sura of cyclic submodules with annihilating ideals R[X]fi, . . . , R[X]fm- 

Make Mn{R) into an F[X]-module via A by declaring p{X)B = p{A)B, for p{X) G R[X] and 
B G Mn{R) ■ Then Mn{R) has invariant factors fi, . . . , fi, f2, ■ ■ ■ , f2, ■ ■ ■ , fm, ■ ■ ■ , fm, where each fi is 
repeated n times. 

Proof. The F[X]-module Mn{R) is just the direct sum of n submodules isomorphic to i?". ■ 
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